Rank in Wordlist | Frequency | Word |
---|---|---|
2641 | 17 | nie,” |
3409 | 13 | word,” |
6575 | 5 | 1,5 |
7201 | 5 | is,” |
8638 | 4 | het,” |
8820 | 4 | maak,” |
9385 | 4 | wees,” |
9427 | 3 | 1,6 |
9482 | 3 | 2,5 |
11136 | 3 | hou,” |
Rank in Wordlist | Frequency | Word |
---|---|---|
3198 | 13 | 100% |
3913 | 10 | 10% |
4249 | 9 | 50% |
4250 | 9 | 60% |
5101 | 7 | 5% |
5746 | 6 | 1% |
5772 | 6 | 40% |
6591 | 5 | 20% |
6595 | 5 | 25% |
6597 | 5 | 30% |
Rank in Wordlist | Frequency | Word |
---|---|---|
4632 | 8 | B&B |
8139 | 4 | S&P |
12897 | 2 | B&Bs |
14759 | 2 | V&A |
19293 | 1 | 1&2 |
21650 | 1 | B&B- |
21651 | 1 | B&B-akkommodasie |
21652 | 1 | B&B-verblyf |
22381 | 1 | Bride&Co |
24901 | 1 | H&N |
Rank in Wordlist | Frequency | Word |
---|---|---|
28037 | 1 | N$2 |
31898 | 1 | US$385 |
Rank in Wordlist | Frequency | Word |
---|---|---|
402 | 113 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
2603 | 17 | foto's |
7765 | 4 | 70's |
10181 | 3 | Prince's |
11218 | 3 | jy't |
11382 | 3 | ma's |
12420 | 2 | .' |
13159 | 2 | Dan's |
13267 | 2 | Ek's |
13363 | 2 | Foto's |
13686 | 2 | Jy's |
Rank in Wordlist | Frequency | Word |
---|---|---|
20436 | 1 | 401+-groep |
Rank in Wordlist | Frequency | Word |
---|---|---|
1792 | 26 | en/of |
2781 | 16 | hy/sy |
2830 | 16 | sy/haar |
4874 | 8 | hom/haar |
4877 | 8 | https://www |
8891 | 4 | o/19 |
9483 | 3 | 2/3 |
11523 | 3 | o/8 |
12088 | 3 | ton/ha |
12423 | 2 | 0/15 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots